The Construction of a Chinese Collocational Knowledge Resource and Its Application for Second Language Acquisition
نویسندگان
چکیده
The appropriate use of collocations is a challenge for second language acquisition. However, high quality and easily accessible Chinese collocation resources are not available for both teachers and students. This paper presents the design and construction of a large scale resource of Chinese collocational knowledge, and a web-based application (OCCA, Online Chinese Collocation Assistant) which offers free and convenient collocation search service to end users. We define and classify collocations based on practical language acquisition needs and utilize a syntax based method to extract nine types of collocations. Totally 37 extraction rules are compiled with word, POS and dependency relation features, 1,750,000 collocations are extracted from a corpus for L2 learning and complementary Wikipedia data, and OCCA is implemented based on these extracted collocations. By comparing OCCA with two traditional collocation dictionaries, we find OCCA has higher entry coverage and collocation quantity, and our method achieves quite low error rate at less than 5%. We also discuss how to apply collocational knowledge to grammatical error detection and demonstrate comparable performance to the best results in 2015 NLP-TEA CGED shared task. The preliminary experiment shows that the collocation knowledge is helpful in detecting all the four types of grammatical errors.
منابع مشابه
Establishing an Argument-Based Validity Approach for a Low-Stake Test of Collocational Behavior
Most of the validation studies conducted across varying test application contexts are usually framed within the traditional conceptualization of validity and therefore lack a comprehensive framework to focus on test score interpretations and test score use. This study aimed at developing and validating a collocational behavior test (CBT), drawing on Kane's argument-based approach to validity. F...
متن کاملCollocational Processing in Two Languages: A psycholinguistic comparison of monolinguals and bilinguals
With the renewed interest in the field of second language learning for the knowledge of collocating words, research findings in favour of holistic processing of formulaic language could support the idea that these language units facilitate efficient language processing. This study investigated the difference between processing of a first language (L1) and a second language (L2) of congruent col...
متن کاملThe Source of Human Knowledge: Plato’s problem and Orwell’s problem
Chomsky cannot help wondering at the fact that we, despite so vast evidence, have little knowledge about the obvious evidence. A good example, I think, is the child’s way of first language acquisition. A great many researchers have studied various aspects of child language acquisition at different stages of the child’ life and have brought to light many details of language development. However,...
متن کاملA Dynamical System Approach to Research in Second Language Acquisition
Epistemologically speaking, second language acquisition research (SLAR) might be reconsidered from a complex dynamical system view with interconnected aspects in the ecosystem of language acquisition. The present paper attempts to introduce the tenets of complex system theory and its application in SLAR. It has been suggested that the present dominant traditions in language acquisition research...
متن کاملThe Source of Human Knowledge: Plato’s problem and Orwell’s problem
Chomsky cannot help wondering at the fact that we, despite so vast evidence, have little knowledge about the obvious evidence. A good example, I think, is the child’s way of first language acquisition. A great many researchers have studied various aspects of child language acquisition at different stages of the child’ life and have brought to light many details of language development. However,...
متن کامل